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(57) ABSTRACT 

A cache system is described that includes a storage that is 
partitioned into a plurality of storage areas, each for storing 
one kind of objects received from remote sites and to be 
directed to target devices. The cache system further includes 
a cache manager coupled to the storage to cause objects to 
be stored in the corresponding storage areas of the storage. 
The cache manager causes cached objects in each of the 
storage areas to be replaced in accordance with one of a 
plurality of replacement policies, each being optimized for 
one kind of objects. 
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WEB CACHE PERFORMANCE BY quickly forwarded to the client device directly from the 

APPLYING DIFFERENT REPLACEMENT proxy server 16. This eliminates delays encountered in 

POLICIES TO THE WEB CACHE communicating between the proxy server 16 and the remote 

server 18. By storing copies of objects received from remote 

BACKGROUND OF THE INVENTION 5 sites, the proxy server 16 reduces the number of requests that 

1. Field of the Invention are directed to the remote server 18, as well as the traffic on 
The present invention pertains to data access network Int< ; met 20 a T** 1 ° f ttinsinittin 8 me responses in the 

systems (e.g., Internet/Intranet systems). More particularly, form of a number of P ackcts must be ^assembled at the 

this invention relates to partitioning a cache such that client device 12. 

different cache replacement policies are employed for dif- 10 Ideally, the cache at the proxy server 16 retains all of the 

ferent storage areas of the cache that store different kinds of cached objects. However, the typical storage capacity of the 

objects to improve performance of the cache. P roxv server 16 & in me ra nge of 256 megabytes to 1 

2. Description of the Related Art terabyte, with most Web proxy capacity being at the lower 
As is known, access to Web documents via the Internet is 15 ha * ° f the /^- before it is important to form a 

generally structured around the Hyper Text Transfer Proto- 15 rc P la f men *™W ° r for dctenomiEg which 

!Li rtJTTD\ til uttd * i • * j objects are to be evicted or replaced from the cache of the 

.^VSh r T ,V req T , - aDd - resp0DS ^ proxy server 16 when a recently received object is to be 

protocol. When a user at a client device (e.g., a personal u j u * j , • 

„ . A A . t .. , „, . , i * cached within exhausted storage space. Two important met- 

computer) designates a particular Web page, at least one ^ ^ m ^ measure * J ^ are 

requests generated. Tie number of requ^ts ,s dependent 20 cache M fate ^ fc ft £ ^ ^ ^ 

upon the sophistication of the designated Web page. Often, . c „ ' .... . £ 

wi u ■ * j /> l i?j 4 ii l percentage of all user requests that are satisfied by the proxy 

a Web page is formed from a number of data files, such as r t f , ^ ... . . . 7 ^ A ™ 7 

. - «i u- ci j * « » « .j d n L r server 16, rather man by access to the original server 18. The 

text files, graphics files, audio files, and video files. Each of . . ... , . tU \ * n * i * «= j 

*u «i • f « l- a n- u- * • byte hit rate is the percentage of all network traffic, measured 

the files is referred to as an object . A multi-object page is . , . , c j j- *i *i_ ^/r 

# . „ , . . . I # . J F & t in bytes, transferred directly from the proxy server 16, 

aesthetically pleasing, but each object requires a separate *u ^ ; * j 

4 / r , ™ J r A . - r . 25 instead of across the external network, 

request and a separate response. Therefore, the time for each _ 

request-and-response round trip plays a role in determining ™ ere are a number of cache replacement strategies or 

the total time a user must wait to view the complete Web P ollcies ^ have been proposed for Web proxy caching. 

p a g e However, the cache m the prior art proxy server typically 

. , . „, . uses only one replacement policy to manage the content of 

The total latency in downloading a Web page or other 30 iU u * • 1 u 1 ♦ r x. 

j , . A A rrrn C1 \ 5 j l f the cache. As is known, each replacement policy has its 

Internet document (e.g., a FTP file) depends on a number of A . , , , K. . t r e i j 

««i a- *u A f • • a t ■ «• advantages and disadvantages, and is suitable for one kind 

factors, including the transmission speeds of communication P . . 7 . , u . & f ,, . , v . 

, 1 • , j • , r . ... of objects. This means that each of the replacement policies 

links between a client device and a server on which the ^ t c *u u • *u 

4 j c, • . , /• 4 , \ j . . * , improves the performance of the cache in the proxy server 

requested file is stored (i.e., content server), delays that are r*\ , • , n ... rnia , . ... /. „^. A 

■ j . .* • . • . j j 1 t° r a chosen metnc (e.g., hit rate, byte hit rate, or latency), 

incurred at the server m accessing the document, and delays 35 c , , ' v c - *• 1 * 

, . . . *. . . . . 4 . ' * ^ For example, a replacement policy of evicting the largest 

incurred at any intermediate device located between the , 4 / I u r n j * \ 

..... ; , t , . # r« . 4 ... document (rather than a number of smaller documents) may 

chent device and the content server. The intermediate . v . . .. . . . , . , \ . r 

j • 1 j . , improve cache hit rate, but does not improve the byte hit 

devices may include proxy servers and routers that are t r A A . . . r r f . .. 

between the client devices and the content server that stores f ' M * ^ exa, »P l6 ' a «P la « m f. nl P^X °/ evict ? n 8 

t1 _ 117 , £1 * ^ 4 , Ci c „ T . the smallest document may increase the byte hit rate, but 

the Web page file. An important benefit of a Web proxy 40 j ♦ ■ *u u w* * 

■ *u u*v* * u u- * • . £• 1 does not improve the cache hit rate, 
server is the ability to cache objects received from the 

remote content servers. This allows the cached objects to be SUMMARY OF THE INVENTION 

quickly retrieved and sent to the client devices if they are ^ _ , .... 

again requested. As is known, some of the cached objects ° nc future of the present invention is to increase the 

may be requested by the same or different client device at 45 P erformance of a cache s y stem * 

later times. The caching operation of the Web proxy server Another feature of the present invention is to increase the 

will be described below, with reference to FIG. 1. performance of a cache system by allowing multiple 

As can be seen from FIG. 1, when a client device or user ^placement policies to be applied to the cache system, 

terminal 12 generates a request for a particular object (e.g., A further feature of the present invention is to increase the 

the object 10 stored in the remote server 18), the cache of the 50 performance of a cache system by partitioning the cache 

proxy server 16 in the local server 14 is searched to system into a number of storage areas, each for storing one 

determine whether the object 10 is stored at the proxy server ^ nd °f objects, such that multiple cache replacement 

16. If the object is not found in the cache of the proxy server policies, each optimized for one kind of objects, can be 

16, a "cache miss" results and the local server 14 directs the applied to the cache system. 

request to the remote server 18 via the Internet 20. As can 55 A cache system includes storage that is partitioned into a 
be seen from FIG. 1, the remote server 18 stores the plurality of storage areas, each for storing one land of 
requested object 10. Once the remote server 18 receives the objects received from remote sites and to be directed to 
request, it directs a response with the requested object 10 to target devices. The cache system further includes a cache 
the client device 12 via the local server 14. During this manager coupled to the storage to cause objects to be stored 
process, the requested object 10 is also cached in the proxy 60 in the corresponding areas of the storage. The cache man- 
server 16 of the local server 14. This eliminates the need for ager causes cached objects in each of the storage areas to be 
the local server 14 to send another request to the remote replaced in accordance with one of a plurality of replace - 
server 18 for the same object 10 at a later time when either ment policies, each being optimized for one kind of objects, 
the same client device 12 or a different client device (not A cache system includes storage that either includes a first 
shown) requests for the same object 10. When the object 10 65 and a second storage device coupled together, or is parti- 
is again requested, the proxy server 16 is accessed and a tioned into the first and second storage devices. Each of the 
"cache hit" results. In this case, the cached object 10 is storage devices stores objects received from the remote sites 
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and to be directed to the target devices. The cache system the communication link 70. The external network can be, for 
also includes a cache manager coupled to the first storage example, the Internet or Intranet network, 
device to send received objects to the first storage device. The communication modules 62 and 64 may be any of a 
The cache manager causes cached objects in the first storage variety of available communication means for routing corn- 
device to be replaced in accordance with a first replacement s municadons using known switching or routing mechanisms, 
policy. The replaced objects are then sent to the second For example, the client-side communication module 62 may 
storage device. The cache manager causes objects stored in be an array of modems for channeling Web page objects to 
the second storage device to be replaced in accordance with personal computers via conventional telephone lines. The 
a second replacement policy different from the first replace- network-side communication module 64 may be a switching 
ment policy. io c ^ rci " t f° r connecting to an external network via a Tl 
- , , - Al _ communication link. In one embodiment, the external net- 
Other features and advantages of the present invention WOfk is ^ rommunicatioils nctwork commonly rc f C rred to 
will become apparent from the following detailed as fatemet or mtranet . 

description, taken in conjunction with the accompanying tu- m jcn • ui * * w l_ 

, ' . . . J - , , . K , 7 ° The proxy 60 is able to access or receive Web pages 

drawing* illustrating by way of example the principles of and/or other objects stored in remote servers (not shown) via 

the invention. 15 ^ communication link 70. The way the proxy 60 receives 

the objects for caching is done in the same manner as has 
been described above with reference to FIG. 1. As described 



BRIEF DESCRIPTION OF THE DRAWINGS 



FIG. 1 shows a prior art Internet/Intranet system that uses above with reference to FIG. 1, a request from the client 

a proxy server to provide objects, such as Web documents, device 12 is initially satisfied by a response with the object 

to client devices. 20 ^ ^ rom ^ c remote server 18. The object 10 is also retained 

prp i r • j v. in the cache of the proxy server 16 of the local server 14, if 

FIG. 2 shows the sfructure of a proxy in accordance with desired so ^ a f&w'wquMt (either from the same client 

one embodiment of the present invention. device n Qr a differeQt cufint device) for ^ same object 10 

FIG. 3 shows in more detail the structure of the cache may be satisfied directly from the proxy server 16. The 

manager of the proxy of FIG. 2. 25 proxy 60 of FIG. 2 is used in a similar manner as the proxy 

FIG. 4 shows the structure of another proxy in accordance server 16 of FIG. 1 to receive and cache objects received 

with another embodiment of the present invention. bom remote sites via the communication link 70. 

FIG. S shows the structure of yet another proxy in c Jf t be seen . firom L FI< ?: 2 > the P™*? 60 L includes £f 

accordance with a further embodiment of the present inven- £? ch6 . 7 ? f ° r £«? ^ ^jeets received in the proxy 60^ 

r 30 The objects that are stored may be components of a Web 
page, such as audio files, HTML files, graphic files, or video 

DETAILED DESCRIPTION OF THE *^ es * °bj ects mav a k° De other files that are transferred 

INVENTION v * a tne Internet, such as File Transfer Protocol (FTP) trans- 
fers. In accordance with one embodiment of the present 

FIG. 2 shows the structure or configuration of a proxy 60 35 invention, the cache 72 is logically partitioned into a number 

that implements one embodiment of the present invention. of storage areas 73.73^ E ach of the storage areas 73-73/1 

The proxy 60 can be used as a proxy server m an Internet/ can be of tbe samc size or different sizes. In addition, the size 

Intranet system for cachmg objects received from a remote of eacb of me storage areas 7 ^_ 73n can be dynamically 

site to a client device connected to the proxy server. In changed or remain unchanged after the initial partition, 

addition, the proxy 60 may also be used in other 4Q Dynamic allocation of sizes allows some storage areas to 

applications, such as in CPU or other cache management, increase in size by shrinking the size of other storage areas, 

garbage collection, etc. Thus, the proxy 60 may also be a p proac h allows the storage areas that are very busy in 

referred to as a cache system or proxy server. caching objects t0 use more space mat fc aUocated t0 the 

As will be described in more detail below, the proxy 60 other inactive storage areas. This allows the cache 72 to 

includes a cache 72 and a cache manager 74. In accordance 45 adjust to changes in the workload without requiring human 

with one embodiment of the present invention, the cache 72 intervention. 

is logically partitioned into a plurality of storage areas 73 one embodiment, the size of each of the storage areas 

through 73n. Each of the storage areas 73-73* is used to 73-73/t is dynamically assigned and changed. In another 

store one kind of objects. For example, the storage area 73 embodiment, the size of each of the storage areas 73-73* is 

is used to store objects of small size while the storage area 50 set statically and remains unchanged after initial partitton- 

73a is used to store objects of large size. The storage area mgi Alternatively, the storage areas 73-73/1 can be initially 

73n may be, for example, used to store very popular objects. se t at the same size and then dynamically changed during the 

The cache manager 74 is coupled to the cache 72 to cause operation. The storage areas 73-73/j can also be initially set 

objects to be stored in the corresponding storage area of the at different sizes and then dynamically changed during the 

cache 72. The cache manager 74 also causes the cached 55 operation, or not changed. 

objects in each of the storage areas 73-73/1 to be replaced in Each of the storage areas 73-73/1 is used to store objects 

accordance with one of a number of replacement policies, of one kind. For example, the storage area 73 can be used to 

each being optimized for one kind of objects. The proxy 60 st0 re objects that have high impact on one performance 

will be described in more detail below, also in conjunction metric (e.g., hit rate) while the storage area 73a is used to 

with FIGS. 2 and 3. 60 s t orc objects that have high impact on another performance 

As can be seen from FIG. 2, the proxy 60 includes a metric (e.g., byte hit rate). The storage area 73n can, for 

client-side communication module 62 and a network-side example, be used to store objects that are extremely popular, 

communication module 64. The client-side communication The objects to be cached in the cache 72 can also be sorted 

module 62 is used to interface the proxy 60 with client in accordance with other categories. For example, the 

devices (not shown in FIG. 2) via the communication link 65 objects can be distinguished as image objects, video objects, 

66. The network-side communication module 64 is used to large objects, small objects, or extremely popular objects, 

connect the proxy 60 to an external network (not shown) via etc. 
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The cache 72 is controlled by the cache manager 74. The virtual memory systems, it does not work well if the cached 

cache manager 74 receives the objects to be cached from the objects are only popular during certain times of a day or 

communication module 64 and sends the objects to the certain days of a month 

cache 72. The cache manager 74 also manages the contents A Md re lacement u ^ ^ ^l-Frequentiy 

stored in the cache 72. This is because some objects are c t1m j nnr* „i *u »i. * i ,u i * . • , J 

un-cachable, such as secured objects and file forms that are ? Scd ^ ^ g ° nU ? m rc P laccs , thc oh * C * which has 

completed by inputs from a user at a client device prior to b f 4 en * CCCSS f d ^ least number ° f umes ; This strategy 

transfer from a server to the client device. The cache attempt to keep more popular objects and replace rarely 

manager 74 has the ability to distinguish cachable objects used ob J ects - However > s °me objects can build a high 

from non-cachable objects. The cache manager 74 deter- n frec 3 UCDc y count ovc r a short period of time and be rarely 

mines which objects are to be stored in which of the storage 10 acce **d after the subject matter is no longer "hot". Such 

areas 73-73*. The cache manager 74 also tracks the location ob J ects often T ? ( m ™ ^in me ad * lon & ^ ob j ects 

of a stored object such that when the object is requested, the are no lon % cr " hot " 

cache manager 74 can send the request to the appropriate A third replacement policy uses the size of the cached 

storage area or can retrieve the requested object from the objects as the measurement, which replaces the largest 

appropriate storage area. As can be seen from FIG. 3, the cached object. This strategy tries to minimize the miss ratio 

cache manager 74 includes a controller 82 and a partition ov replacing one large object rather than many smaller ones, 

lookup table 81. The partition lookup table 81 tracks which However, some of the small objects brought into the cache 

of the storage areas 73-73/1 stores what kind of objects. The mav never De accessed again. The Size strategy does not 

controller 82 accesses the partition lookup table 81 when- provide any mechanism to evict such objects, which leads to 

ever it receives an object to be stored in the cache 72 or a pollution of the cache. This Size policy has a poor byte hit 

request for a cached object. rate since it discriminates against large-sized objects. 

A more important role of the cache manager 74 is to A fourth replacement policy is the Greedy-Dual-Size 

determine which objects are to be evicted from the cache 72 (GDS) replacement policy. This policy replaces the object 

when the cache 72 is detected in need of a replacement ^ with the lowest key value for a certain utility/cost function, 

operation. The cache 72 typically requires a replacement When an object i is requested, it is given a priority key K, 

operation when, for example, the capacity of the cache 72 which is computed as follows: 
has been exhausted. Thus, when a recently received object 

is eligible for caching, the cache manager 74 determines K r cjs,+L 

which of the previously cached objects is to be evicted from 30 where c ^ me ^ assoc i ate d with bringing the object i into 

the cache 72. If the cache manager 74 determines that the me ^ chc q can be determined using a number of different 

recently received object is likely to be less "popular 1 ' than cost fr^ns md corresponding performance metrics. S, is 

any of the presently cached objects, the object remains me object size ^ L is a running age factor that starts at 0 

un-cacned. ^ ^ updated for each replaced (evicted) object f to the 

The cache manager 74 controls the eviction of cached 35 priority key of this object in the priority queue (i.e., L~K,). 

objects in the cache 72. Each storage area of the cache 72 is The GDS policy outperforms previously known caching 

refreshed (i.e., evicting cached objects stored in the cache) strategies with regard to a number of metrics, including 

in accordance with one of a number of replacement strate- cac h e hit ratio and byte hit ratio. However, the GDS algo- 

gies or polices. Each replacement policy is optimized for one nthm or policy does have at least one shortcoming, namely, 

performance metric. For example, when the storage area 73 40 that the algorithm does not take into account how many 

stores objects that have high impact on the hit rate, a tj mes tbe cac hed object was accessed in the past, 

replacement policy that is optimized for the hit rate is used a fifth policy is a LFU-Aging replacement policy that 

for the storage area 73. This allows multiple replacement replaces the least frequently used object, avoiding cache 

policies to be simultaneously applied to the cache 72. This pollution. The LFU component of the replacement policy 

also allows the cache 72 to cache objects of different kinds 45 maintains a frequency count for each object in the cache, 

while offering optimized performance across several perfor- jhe aging component of the replacement policy avoids 

mance metrics at the same time. cac h e pollution by reducing the frequency count of each 

The hit rate and byte hit rate are the two main (or most cached object by a factor of two whenever the average 

common) performance metrics used to measure perfor- frequency count exceeds a threshold parameter. Since some 

mance of proxy cache such as the cache 72. As described so objects may be extremely popular, they might stay in the 

above, the hit rate measures the number of requests satisfied cache for longer than desired or have significant influence on 

from the proxy cache as a percent of total requests. The byte the current frequency count. This policy retains popular 

hit rate measures the number of bytes that the proxy cache objects for longer time periods and is able to discriminate 

served directly as a percent of the total number of bytes for against those objects which are only accessed once while in 

all requests. To get the best hit rate, it is more "profitable" ss the cache (i.e., extremely unpopular objects), 

to replace one large object than many small objects. To get A sixth policy is a LRU-Log (size) replacement policy 

the best byte hit rate, larger files must be considered. that is similar to the LRU replacement policy. The policy 

There are a number of replacement policies or strategies replaces the object that has the largest log (size) and is the 
that can be used for proxy caching. However, each of the least recently used object among the same log (size) object, 
policies is optimized for one chosen metric (e.g., hit rate, 60 A seventh policy is a Lowest-Latency-First replacement 
byte hit rate, or latency) and has its own advantages and policy that removes the object with the lowest download 
disadvantages. For example, a well known replacement latency. This policy is designed to minimize average latency, 
policy is the Least-Recently-Used (LRU) algorithm. An eighth policy is a Lowest-Relative-Value (LRV) replace- 
According to this policy, when an eviction is required in ment policy that includes the cost and size of an object in the 
order to store a recently received object, the previously 65 calculation of a value that estimates the utility of keeping an 
cached object that was requested least recently is evicted. object in cache. This replacement policy evicts the object 
Although this policy is a simple policy that is effective for with the lowest utility value. This policy also achieves high 
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hit rate, but requires significant cache space as it does not 
consider size at all. This policy also has the ability to age the 
object set to prevent cache pollution. 

A ninth replacement policy is a Greedy-Dual-Size with 
Frequency (GDSF-Hits) replacement policy. This policy is 
the improvement of the above described GDS replacement 
policy. This policy reflects object access patterns by incor- 
porating a frequency count F,- in the computation of K,. such 
that 

When C ( -=l for all i, this policy achieves very high hit rates. 

A tenth replacement policy is a Least Frequently Used 
with Dynamic Aging (LFU-DA) replacement policy. The 
difference between this policy and the LFU-Aging policy 
described above is that this policy does not require the 
setting of parameters. This is done by replacing the tuneable 
aging mechanism of the LFU-Aging policy with a dynamic 
mechanism (i.e., the inflation factor L) such that the key 
value K, for object i is calculated as follows: 

When C— 1, this policy achieves similar hit rates and byte hit 
rates to the LFU-Aging replacement policy. In addition, this 
dynamic aging mechanism is more efficient computationally 
than that of the LFU-Aging policy. 

Alternatively, other known cache replacement policies 
may also be used for the cache 72. All of the replacement 
policies are stored in a replacement policy database 83 of the 
cache manager 74 and can be retrieved by a replacement 
controller 84 of the cache manager 74 to be applied to the 
appropriate ones of the storage areas 73-73/1 to replace the 
cached objects. Each of the components 81-84 of the cache 
manager 74 can be implemented using known means. 

One of the advantages of partitioning the cache 72 into a 
number of storage areas 73-73/1 and allowing each storage 
area to be applied with one replacement policy is that 
different performance metrics can be achieved simulta- 
neously on a single cache 72. Another advantage of parti- 
tioning is that the scheme works well in the situation of 
non-uniform distribution. For example, some objects are 
extremely popular and account for a large fraction of all 
requests (and potential cache hits). Caching just these few 
objects results in great improvements in performance. 
However, once these extremely popular objects have all 
been cached, it becomes increasingly difficult to identify 
objects that will have a significant impact on performance 
and thus large increases in cache space are required to 
further improve the performance of the cache. In addition, 
the number of the popular objects may also vary from time 
to time. Through the use of partitioning, the cache 72 is able 
to cache this set of extremely popular objects with dynamic 
size, thus achieving most of the performance of a larger 
cache and still having cache space remaining to store objects 
that have more impact on a different performance metric. 
Another advantage of partitioning is the different reference 
patterns between request streams. For example, the refer- 
ence patterns for image objects are quite different from the 
reference patterns for video objects. Ibis means that parti- 
tioning protects image objects from being replaced from the 
storage area where the video objects are added. 

FIG. 4 shows another cache proxy 100 in accordance with 
another embodiment of the present invention. As can be seen 
from FIG. 4, the cache 102 is comprised of a number of 
storage or cache devices 105-105* connected together. 
Alternatively, the cache 102 is only logically partitioned into 
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a number of storage areas. Each of the cache devices (or 
partitioned storage areas) 105-105/j stores objects of one 
kind. The cache device 105 is connected to a cache manager 
104. The cache manager 104 sends all objects for caching in 
the cache 102 to the cache device 105. In addition, the cache 
manager 104 sends aU requests for cached objects to the 
cache device 105. 

Upon receiving an object for caching from the cache 
manager 104, the cache device 105 determines whether the 
received object is to be stored in the cache device 105 or not. 
If the cache device 105 is not full, no replacement is needed 
and the object is stored in the cache device 105. If the cache 
device is full, replacement is needed and the cache device 
105 deterrnines which object is to be sent to the next cache 
device 105a based on its replacement policy. The replaced 
objects form the object input stream for the next cache 
device 105a. 

When the cache device 105a receives a replaced object 
from the cache device 105, it also determines if replacement 
is needed to cache the replaced object in accordance with its 
replacement policy. As described above, the two devices 105 
and 105a may employ entirely different policies for their 
replacement operations. If replacement is needed, the cache 
device 105a then keeps those objects that will help the 
device perform well based on its replacement policy. The 
replaced object is then sent to the next cache device. The 
process continues until no replacement operation is needed 
in one of the cache devices 105-105/j, or until the last one 
of the cache device 105/1 is reached. Replacement from the 
last cache device 105/t is removed from the cache 102. 
Throughout the process, the cache manager 104 is fully 
aware of which object is stored in which of the cache devices 
105-105/1. 

When an object is again accessed (i.e., on a cache hit), the 
object may be moved back up the chain (e.g., to the 
preceding cache device, to the initial cache device, or to 
some cache device ahead in the chain). This allows the 
popular objects to stay in the cache 102 longer while 
unpopular objects move to the end of the chain quickly and 
are removed from the cache 102. 

When the cache manager 104 receives a request for a 
cached object in one of the cache devices 105-105/j of the 
cache 102, the cache manager 104 can send the request to the 
cache device 105, regardless whether the requested object is 
cached in the cache device 105 or not. If the object is stored 
in the cache device 105, the object is fed back to the cache 
manager 104. If not, the cache device 105 retrieves the 
object from the subsequent one of the cache devices 
105-105/1 where the object is stored. The object is then sent 
to the cache manager 104 from the cache device 105. 
Alternatively, the cache manager 104 sends the request to 
the cache device that stores the requested object directly. 

Each of the cache devices 105-105n is refreshed in 
accordance with one of a number of replacement policies. 
Each of the cache devices 105-105/1 receives its respective 
replacement policy from the cache manager 104 via the bus 
109. Each of the cache devices 105-105/1 then conducts its 
own replacement operation in accordance with the replace- 
ment policy received. 

FIG. 5 shows another cache proxy 200 in accordance with 
another embodiment of the present invention. As can be seen 
from FIG. 5, the cache 205 is structured such that it 
encompasses the characteristics of the cache 72 of FIGS. 
2-3 and the cache 102 of FIG. 4. As can be seen from FIG. 
5, the cache 205 is partitioned into a number of storage areas 
206-206/1, each being comprised of a number of storage or 
cache devices. For example, the storage area 206 includes a 
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number of cache devices 300 through 300* connected being different from one another such that the storage 

together and the storage area 206* includes a number of can store objects of different kinds while offering 

cache devices 320-320* connected together. Each of the optimized performance across different performance 

cache devices in each of the storage areas 206-206/1 stores metrics at the same time, wherein the cache manager 

objects of one kind. The first cache device in each of the s mav ca Use more man one cac hed object in a storage 

storage areas 206-206* is connected to a cache manager area to t> c evicted in order for a single new object to be 

204. The cache manager 204 sends all objects for caching in stored in the same storage area, 

the storage area 206 to its first cache device 300. Similarly, 2. The cache system of claim 1, wherein when the cache 

the cache manager 204 sends all objects for caching in the manager receives an object from remote sites, the cache 

storage area 206a to its first cache device 310. 10 man ager determines to which one of the storage areas the 

Upon receiving an object for caching from the cache received object is to be cached based on which kind the 

manager 204, the first cache device in each of the storage object belongs to. 

areas 206-206* determines whether the received object is to 3 cache system of claim 1, wherein the cache 

be stored in the first cache device or not. For example, the manager dynamically changes the size of each of the storage 

cache device 300 of the storage area 206 determines if the ^ areas of the storage. 

object received from the cache manager 204 to the storage 4. The cache system of claim 1, wherein the cache 

area 206 is to be stored in the cache device 300. If so, the manager statically partitions the storage areas such that the 

cache device 300 stores the object. If not, the cache device s iz e 0 f eacn 0 f me storage areas remains unchanged after 

300 sends the object to the next cache device 301. The cache partitioning. 

device 301 then determines if the object is to be stored in the 2 o 5. The cache system of claim 1, wherein the cache 

cache device 301 or not. The process continues until the manager further comprises: 

object is fmaUy stored in the appropriate one of the cache a look ^ ^ mdicatcs whkh ^ of 

devices 300-300*. Tte caching and replacement process is objectfi afe stored ^ y ^ Qf ^ e 

the same as the process described above in connection with , , , , 

FIG. 4, which will not be described in more detail below. M a ^placement policy database that stores the plurality of 

When the cache manager 204 receives a request for a replacement policies; 

cached object in the cache 205, the cache manager 204 first a replacement controller that applies one of the replace - 

determines which of the storage areas 206-206* the request rnent policies to a corresponding one of the storage 

should be sent to. For example, if the cache manager 204 areas. 

determines that the request should be sent to the storage area 30 The cache system of claim 1, wherein the plurality of 
206, the cache manager 204 can send the request to the cache replacement policies are selected from a group comprising a 
device 300 of the storage area 206, regardless whether the Least-Recently-Used (LRU) replacement policy, a Least- 
requested object is cached in the cache device 300 or not. If Frequently-Used (LFU) replacement policy, a Greedy-Dual- 
the object is stored in the cache device 300, the object is fed Size (GDS) replacement policy, a LFU-Aging replacement 
back to the cache manager 204. If not, the cache device 300 35 V^icy, a LRU-Log (size) replacement policy, a Size replace- 
retrieves the object from the subsequent one of the cache mcnt policy, a Lowest-Latency-First replacement policy, a 
devices 300-300* where the object is stored. The object is Lowest-Relative- Value (LRV) replacement policy, a GDS- 
then sent to the cache manager 204 from the cache device Frequency replacement policy, and a LFU-Dynamic Aging 
200. Alternatively, the cache manager 104 sends the request replacement policy, 
to the cache device that stores the requested object directly. 40 7. A cache system, comprising: 

Each cache device in each of the storage areas 206-206* (A) a storage that includes a first and a second storage 

is refreshed in accordance with one of a number of replace- device coupled together, wherein each of the first and 

ment policies. Each cache device receives its respective second storage devices stores objects of different sizes 

replacement policy from the cache manager 204 via the bus received from remote sites and to be directed to target 

209. Each cache device in each of the storage areas 45 devices; 

206-206* then conducts its own replacement operation in (g) a cacne manager coupled to the first storage device to 

accordance with the replacement policy received. senc j received objects to the first storage device, 

In the foregoing specification, the invention has been wherein the cache manager causes cached objects in the 

described with reference to specific embodiments thereof. It f K t storage device to be replaced in accordance with a 

will, however, be evident to those skilled in the art that 50 first replacement policy, wherein the replaced objects 

various modifications and changes may be made thereto are sent to the second storage device, wherein the cache 

without departing from the broader spirit and scope of the manager causes cached objects in the second storage 

invention. The specification and drawings are, accordingly, device to be replaced in accordance with a second 

to be regarded in an illustrative rather than a restrictive replacement policy different from the first replacement 

sense. 55 policy such that the storage can store objects of differ- 

What is claimed is: en t kinds while offering optimized performance across 

1. A cache system comprising: different performance metrics at the same lime, 

a storage that is partitioned into a plurality of storage wherein the cache manager may cause more than one 

areas, each for storing one kind of objects of different cached object in one of the first and second storage 

sizes received from remote sites and to be directed to 60 devices to be evicted in order for a single new object to 

target devices; be stored in the same storage device, 

a cache manager coupled to the storage to cause objects 8. The cache system of claim 7, further comprising a third 

to be stored in the corresponding areas of the storage, storage device coupled to the second storage device to store 

wherein the cache manager causes cached objects in objects replaced from the second storage device, wherein the 

each of the storage areas to be replaced in accordance 65 cache manager replaces the objects stored in the third 

with one of a plurality of replacement policies, each storage device in accordance with a third replacement policy 

policy being optimized for one kind of objects and different from the first and second replacement policies. 
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9. The cache system of claim 8, wherein when the first 
storage device receives an object for caching and the first 
storage device is full, the first storage device passes a 
replaced object to the second storage device in accordance 
with the first replacement policy. 

10. The cache system of claim 9, wherein when the 
second storage device receives the replaced object for cach- 
ing and the second storage device is full, the second storage 
device passes a second replaced object to the third storage 
device in accordance with the secood replacement policy. 

11. The cache system of claim 10, wherein when the third 
storage device receives the replaced second object for cach- 
ing from the second storage device and the third storage 
device is full, the third storage device evicts a third replaced 
object in accordance with the third replacement policy. 

12. The cache system of claim 11, wherein when a cached 
object in the third storage device is requested by an access 
request in a cache hit operation, the object is moved back to 
the first storage device. 

13. The cache system of claim 8, wherein each of the first, 
second, and third replacement policies is selected from a 
group comprising a Least-Recently-Used (LRU) replace- 
ment policy, a Least-Frequently-Used (LFU) replacement 
policy, a Greedy-Dual-Size (GDS) replacement policy, a 
LFU-Aging replacement policy, a LRU-Log (size) replace- 
ment policy, a Size replacement policy, a Lowest -Latency- 
First replacement policy, a Lowest-Relative- Value (LRV) 
replacement policy, a GDS -Frequency replacement policy, 
and a LFU-Dynamic Aging replacement policy. 

14. A cache system, comprising: 

(A) a storage that is partitioned into a first and a second 
storage area, each including a first and a second storage 
device coupled together; 

(B) a cache manager coupled to the first storage device of 
each of the storage areas to cause objects to be stored 
in the corresponding areas of the storage, wherein the 
cache manager causes cached objects in the first storage 
device of the first storage area to be replaced in 
accordance with a first replacement policy, wherein the 
replaced objects are then sent to the second storage 
device of the first storage area and the cache manager 
causes the cached objects stored in the second storage 
device of the first storage area to be replaced in 
accordance with a second replacement policy different 
from the first replacement policy, wherein the cache 
manager causes cached objects in the first storage 
device of the second storage area to be replaced in 
accordance with a third replacement policy different 
from the first replacement policy, wherein the replaced 
objects are then sent to the second storage device of the 
second storage area and the cache manager causes the 
cached objects stored in the second storage device of 
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the second storage area to be replaced in accordance 
with a fourth replacement policy different from the 
third replacement policy. 

15. The cache system of claim 14, wherein the cache 
manager further comprises: 

a partition lookup table that indicates which object is 

stored in which of the storage areas; 
a replacement policy database that stores the replacement 

policies; 

a replacement controller that applies the corresponding 
one of the replacement policies to a corresponding one 
of the storage devices of the corresponding storage 
area. 

16. The cache system of claim 14, wherein each of the first 
and second storage areas further comprises a third storage 
device coupled to the second storage device to store objects 
replaced from the second storage device, wherein the cache 
manager replaces objects stored in the third storage device 
in accordance with a fifth replacement policy. 

17. The cache system of claim 14, wherein the storage 
further comprises a third storage area including a plurality of 
storage devices coupled together. 

18. The cache system of claim 14, wherein the replace- 
ment policies are selected from a group comprising a 
Least-Recently-Used (LRU) replacement policy, a Least- 
Frequently-Used (LFU) replacement policy, a Greedy-Dual- 
Size (GDS) replacement policy, a LFU-Aging replacement 
policy, a LRU-Log (size) replacement policy, a Size replace- 
ment policy, a Lowest-Latency-First replacement policy, a 
Lowest-Relative- Value (LRV) replacement policy, a GDS- 
Frequeocy replacement policy, and a LFU-Dynamic Aging 
replacement policy. 

19. The cache system of claim 14, wherein when the first 
storage device of one of the first and second storage areas 
receives an object for caching and the first storage device is 
full, the first storage device passes a replaced object to the 
second storage device in accordance with one of the first and 
the third replacement policies. 

20. The cache system of claim 19, wherein when the 
second storage device of one of the first and second storage 
areas receives the replaced object for caching and the second 
storage device is full, the second storage device determines 
a second replaced object in accordance with one of the 
second and fourth replacement policies. 

21. The cache system of claim 20, wherein when a cached 
object in the second storage device of one of the first and 
second storage areas is requested by an access request in a 
cache hit operation, the object is moved back to the corre- 
sponding first storage device. 
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